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Abstract: In large asexual populations, beneficial mutations have to com- 
pete with each other for fixation. Here, I derive explicit analytic expressions 
for the rate of substitution and the mean beneficial effect of fixed mutations, 

under the assumptions that the population size N is large, that the mean 
effect of new beneficial mutations is smaller than the mean effect of new 
deleterious mutations, and that new beneficial mutations are exponentially 
distributed. As N increases, the rate of substitution approaches a constant, 
which is equal to the mean effect of new beneficial mutations. The mean 
effect of fixed mutations continues to grow logarithmically with N. The 
speed of adaptation, measured as the change of log fitness over time, also 
grows logarithmically with N for moderately large N, and it grows double- 
logarithmically for extremely large A^. Moreover, 1 derive a simple formula 
that determines whether at given N beneficial mutations arc expected to 
compete with each other or go to fixation independently. Finally, I verify all 
results with numerical simulations. 
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INTRODUCTION 



In asexual populations, beneficial mutations that have arisen indepen- 
dently in different organisms cannot recombine and therefore have to compete 
for fixation. This effect, often referred to as clonal interference ()Gerrish and Lenski 1998]) . 
leads to a slowdown of adaptation for large population sizes. A similar ef- 
fect can arise in sexual populations, and is called the Hill-Robertson effect 
fjHill and Robertson 19661) traffic problem ( Stephan 1995| Kirby and Stephan 1996 ) . 



(See also lCrow and Kimura 1965j IKimura and Ohta 19711 IBarton 1995t lOrr 2000) 

IMcVean and (:;haTlesworth 20001 ICTerrish 20011 [lohnson and Barton 20021 |Kim and Stephan 20031 



Clonal interference has two main consequences: As the population size be- 
comes large, the increase in the rate of adaptation with increasing popu- 
lation size declines, and the beneficial mutations that are fixed convey in- 
creasingly larger beneficial effects. A number of recent studies have tried 
to quantify the rate of adaptation and the distribution of beneficial muta- 
tions in various organisms whose predominant mode of replication is asexual, 
such as Escherichia coli fide Visser et al. 1999j llmhof and Schlotterer 200H 
IHozen et al. 2002;i . vesicular stomatitis virus (jMiralles et al. 19991 fMiralles et al. 2000;i . 
and bacteriophages <I>X174 and G4 (IBuU et al. 20001 IKichler Holder and Bull 200111 . 

Early studies of clonal interference date back to Kimura and coworkers 
(ICrow and Kimura 1965| IKimura and Ohta 1971\ . These authors considered 
the same effect s for all beneficial mutations. Gerrish and Lenski (1998)| were 



the first to consider a distribution of beneficial effects, but neglected delete- 
rious mutations. The results of Gerrish and Lenski (1998)] were later gener- 



alized by Orr (200'0)| to include deleterious mutations. In the works of both 



Gerrish and Lenski (1998)] and Orr (200'0)j the final results (formulae for the 



expected substitution rate and for the mean effect of fixed mutations) were 
given in the form of unwieldy double integrals, which are difficult to interpret. 
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[However, [Gerrish and Lenski (1998)| gave explicit expressions for the unreal- 
istic case of uniformly distributed beneficial mutations.] From these integrals, 
we cannot easily estimate for what parameter settings the interference effect 
becomes important, and it is unknown how the speed of adaptation behaves 
for very large A^. Moreover, even numerical evaluation of the integrals can 
be tricky, because the integrand is strongly peaked. Rozen et al. (2002)] gave 



an explicit expression for the distribution of beneficial effects of fixed mu- 
tations at large A^. However, this expression also does not lead to a simple 
expression for the mean. 

Here, I derive asymptotic expansions for the expected rate of adaptation 
and for the mean beneficial effect of fixed mutations, under the assumption 
that beneficial mutations are distributed exponentially. This assumption 
is reasonable, and has good theoretical support from extreme-value theory 
dGillespie 19831 jGillespie 199T1 lOrr 200311 . I find that for very large N, the 
expected rate of adaptation approaches a limiting value that is given by the 
mean selective advantage of new beneficial mutations. The mean beneficial 
effect of fixed mutations, on the other hand, does not reach a hard limit, but 
continues to grow with the logarithm of the population size. 

MATERIALS AND METHODS 



Model: 1 consider the model analyzed by Orr (200'0)| I assume that 



haploid organisms replicate asexually, and accumulate both deleterious and 
advantageous mutations. The total mutation rate per genome and genera- 
tion is U, and the fraction of beneficial mutations is pb- Hence, the beneficial 
mutation rate is Up^,, and the deleterious mutation rate is f/(l — Ph) (~ U 
for small pt,). The effects of both beneficial and deleterious mutations are 
drawn from probability distributions; all mutations act multiplicatively. I 
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use a slightly simplified notation for beneficial and deleterious effects of mu- 
tations in comparison to |Orr (2000) By s, I denote the effect of a particular 
mutation, either beneficial (in which case fitness is increased by a factor 1 + 
or deleterious (in which case fitness is decreased by a factor 1 — s). The mean 
effect of beneficial mutations is Sb, and the mean effect of deleterious muta- 
tions is Sd- The harmonic mean of the distribution of deleterious mutations is 
sh- At equilibrium (when all beneficial mutations have gone to fixation), the 
frequency of the class of individuals with the highest fitness is approximately 
Po = exp(-f//sH) (!()rr 2000;) . 

I assume that beneficial mutations are exponentially distributed, that is, 
beneficial effects are drawn from a distribution with probability density func- 
tion /(s) = exp(— s/sb)/sb. The analytic calculations make no assumption 
about the distribution of deleterious mutations, but all simulations have been 
carried out with a truncated exponential distribution (see Simulation meth- 
ods). I assume that on average deleterious mutations have a much larger 
effect than beneficial mutations (sb <^ Sd), such that beneficial mutations 
rarely compensate deleterious mutations. 

Simulation methods: I carried out simulations of the model described 
in the previous subsection. In the simulations, sequences were propagated 
in discrete generations. The number of offspring sequences of a sequence i in 
the next generation was binomially distributed with mean Wi/ (w), where Wi 
is the fitness of sequence i and {w) is the average fitness of the population. 
Each offspring sequence suffered fcb beneficial and fed deleterious mutations, 
where kh and kd were Poisson-distributed with means Up^ and f/(l — Pb), 
respectively. Each beneficial mutation increased the fitness of a sequence by 
a factor of 1 + s, where s was drawn from an exponential distribution with 
mean Sb. Each deleterious mutation decreased the fitness of a sequence by a 
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factor of 1 — s, where s was drawn from a truncated exponential distribution 
with parameter a. The distribution was truncated both to the left and to the 
right. The left truncation was necessary to avoid a zero harmonic mean sh- 
(For Sh = 0, the predicted frequency of the unmutatcd individuals is Pq = 0, 
and the theory breaks down.) I used as a cutoff for the left truncation the 
value 0.01. The right truncation is necessary to avoid negative fitness, and 
here I used the cutoff value 1. As parameter for the truncated exponential 
distribution, I used a — 0.1, which results in Sd = 0.11 and sh = 0.05. 

I let the population equilibrate for 1000 generations at pb = before 
I set Pb to its desired value and started measuring the rate of adaptation. 
Simulations were continued for up to 50,000 generations, depending on pop- 
ulation size (the smaller the population size, the longer the simulation run), 
and replicated between 5 and 50 times (the smaller the population size, the 
more replicates). For each sequence in the population, I kept track of the 
number of beneficial mutations it had accumulated. At the end of a simula- 
tion run, I subdivided the final population into classes with equal numbers of 
beneficial mutations and determined the most abundant class. The number 
of beneficial mutations n in the most abundant class divided by the number 
of generations since equilibration At served as an estimator for the rate of 
substitution k. I averaged k over all replicates to obtain the result reported 
for E[A;]. In order to obtain an estimate for the change in log fitness over 
time dlogw{t)/dt, 1 determined the sequence with the least number of dele- 
terious mutations in the most abundant class, and divided the logarithm of 
the sequence's fitness by At. Again, I averaged over all rephcates to arrive at 
the values reported here. To test whether this approach was comparable to a 
direct measurement of the change in population fitness, I fitted for several ex- 
emplary runs a straight line to the logarithm of the average population fitness 
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as a function of time, from the end of the equihbration time to the end of the 
simulation run, and took the slope of that line as the value for d log w{t)/dt. 
The differences in the results obtained with these two alternative approaches 
were minute. 

RESULTS 

Expected substitution rate and mean beneficial effect: Beneficial 
mutations arise in the populations at rate NUph- If this rate is small, then 
they do not interfere with each other, and independently go to fixation or are 
lost to drift. In this case, their expected probability of fixation (averaged over 
all possible beneficial effects) is 2sb-Po (jOrr and Kim 1998) [Campos 2003 1, 
and therefore the expected rate of substitution E[k] becomes ()Urr 2000|1 

E[k]=2NUp^s^Po. (1) 

When beneficial mutations interfere with each other, then their probability 
of fixation is reduced by a factor of e~^^^\ with I{s) = 2Up]^PoN In N{s + 
Sh)s^^e~^^^^ (IGerrish and Lenski 1998|l()rr 2000)) . I(s) is the expected num- 
ber of new mutations of effect larger than s that occur in the time interval 
of length t = (2/s) InA^ during which a mutation of effect s goes to fixation. 
The form of I{s) that 1 use throughout this article assumes that beneficial 
mutations are distributed exponentially. The general form for arbitrary dis- 
tributions is given in ()(;errish and Lenski 19981 lOrr 2000)) . The expected 
rate of substitution is obtained by integrating over all beneficial mutations. 
Again using the assumption that beneficial mutations are exponentially dis- 
tributed, one finds that (jC^errish and Lenski 19981 1( )rr 2000;i 

/•oo 

E[k] = 2NUpi,Posl^ / se-^^'^-''''ds . (2) 
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In Appendix 1, I show that for large A^, the substitution rate becomes 

E[A;] ^ ^ [\n{2Up^PoN In A^) + 0.5772] . (3) 

In the limit of A^ ^ oo, this expression simplifies to 

E[A;] ^ Sb , (4) 

that is, the rate of substitution reaches a hard limit that is given by the mean 
beneficial effect of new mutations. Figure Q shows that the approximation 
Eq. (jni) works well for intermediate to large A^. However, E[A;] comes close to 
its limiting value Sb only for very large A^. 

According to Gerrish and Lenski (1998)] we can calculate the mean ben- 
eficial effect of fixed mutations E[s] as 

oo 

n-s] = ^ . (5) 



This expression simplifies to (see Appendix 1 for details) 

E[s] ^ Sb[H2UphPoN In A^) + 0.5772] (6) 

for large A^. Figure |21 shows that this approximation also works very well for 
intermediate to large A^. 

Estimating the onset of clonal interference: For small A^, the ex- 
pected substitution rate is E[k] ^ 2NUphPoS\y [Eq. (jT))], while for very large 
A^, we have E[fc] = Sb. On the basis of these two equations, we can derive 
a simple estimate for the parameter regions in which clonal interference is 
relevant: We are certainly in the clonal-interference regime if the estimate of 
E[A;] for small A^ exceeds that for large A^, that is, if 
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This result has a simple interpretation: Clonal interference becomes relevant 
if — on average — one beneficial mutation arises in the zero-mutation class at 
least every other generation. Note that the mean effect of deleterious mu- 
tations enters this result (through Pq), but not the mean effect of beneficial 
mutations. 

The estimate Eq. (|7j) is fairly conservative, in the sense that when 
exceeds l/(2f/j9b-Po)j we are sure that clonal interference is important, but 
clonal interference starts having some effect already for smaller N. In Ap- 
pendix 1, I show that an improved estimate is 

N\nN > ^ . (8) 

Figure HI illustrates where the two estimates Eqs. ((Tj) and (jH)) lie with respect 
to the exact expression and the approximations for E[A;]. 

Speed of adaptation: The expected substitution rate is in general not 
an accurate measure for the speed of adaptation, because it disregards the 
beneficial effect of the fixed mutations. A better measure is the change in 
fitness (or log fitness, which is more appropriate for a multiplicative model) 
over time. Clearly, the faster fitness increases, the faster a population adapts 
to its environment. 

As mentioned by Johnson and Barton (2002)[ the change in log fitness is 
given by 

f^ = EWlog(l + EW). (9) 

Using E[A;] = Sb and E[s] as given in Eq. (jH)), we find for large A^ 

^^"^^^'^^ ^ Sbln[l +Sbln(2f/j9b^oA^lnAr) + 0.5772sb] • (10) 

This equation predicts two different regimes for d log w (t) /dt, depending on 
the values of Sb and A^. If Sb is much smaller than one, and A^ is only 
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moderately large (but sufficiently large such that ^[k] ~ Sb), then we can 
approximate ln(l + E[s]) with E[s] and find 

~ sl[\n{2UpbPoN\nN) + 0.5772] . (11) 

In this regime, the speed of adaptation depends logarithmically on A^. If on 
the other hand N is extremely large and Sb is not extremely small, then 

dlogw{t) 



dt 



Sbln[sbln(2f/pbPoiVlnA^)]. (12) 



In this regime, the speed of adaptation depends double-logarithmically on 
N. 

For comparison, I now calculate the speed of adaptation for small A^. For 
small N, clonal interference can be neglected, and therefore the mean bene- 
ficial effect corresponds to the mean effect of beneficial mutations that have 
survived drift. The distribution of these mutations is (/(s) = (s/sb) exp(— s/sb) 
fIRozen et al. 20021 lOtto and Jones 2000jl , and the mean is E[s] = 2sb. Using 
ln(l + E[s]) E[s], we find for small N: 

d\ogw(t) 



dt 



AsiNUp^Po. (13) 



To summarize, the speed of adaptation grows linearly in for small N, and 
logarithmically or double-logarithmically in for large A^. Interestingly, 
in the clonal interference regime, growth in the speed of adaptation comes 
from the fixation of mutations with increasingly larger effects, rather than 
from the fixation of increasingly more beneficial mutations. Hence, clonal 
interference slows down the speed of adaptation, but it does not lead to a 
hard speed limit as long as beneficial mutations of increasingly larger effect 
are accessible. 
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Simulation results: I have carried out extensive simulations to test the 

men- 



accuracy of the clonal interference theory. Gerrish and Lenski (1998^ 



tioned that they found good agreement between theory and simulations, but 
they did not report any simulation results or the parameter regions they had 
considered. Orr (2000)] reported some simulation results, but his simulations 
were not in the clonal interference regime [as defined by Eq. ©]. 

Figure El shows the expected substitution rate as a function of the muta- 
tion rate U, both as predicted by Eq. (jS)) and as found in simulations. Below 
the optimal mutation rate U = sh at which the substitution rate assumes 
its maximum ()Urr 2000|) . agreement between theory and data is good over 
a wide range of population sizes. Above U = su, the theory underestimates 
E[A;]. This effect is caused by the accumulation of slightly deleterious mu- 
tations in the simulations. The theory assumes that only those beneficial 
mutations that arise in backgrounds free from deleterious mutations can go 
to fixation. However, for large U, sequences that carry one or several slightly 
deleterious mutations become so frequent that it becomes likely that one of 
them acquires a beneficial mutation of sufficiently large effect to compensate 
the deleterious background, and goes to fixation. The degree to which the 
theory underestimates E[k] increases as sh decreases. In the limit of sh = 0, 
the theory predicts that E[A;] = 0, while simulations show that the true re- 
sults (with identical Sb and Sd) are not substantially different from those 
shown in Fig. El (data not shown). Surprisingly, the theory accurately pre- 
dicts the change in log fitness dlnw{t)/dt even in the regime of large U, as 
long as d In w (t) / dt is not negative (Fig. |3|) . [For very high U, Muller 's ratchet 
(jMuUer 19641 ll^elsenstein 19741 |Haigh 1978| K^ordo and (]harlesworth"2000|l 
becomes the predominant force in the dynamic of the evolving population, 
and the change in log fitness can assume negative values.] Apparently, in the 
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regime of large [/, the theory underestimates E[fc] and overestimates E[s], in 
such a way that the two effects nearly cancel each other. 

In Fig. I show the change in log fitness as a function of the fraction of 
beneficial mutations ph- Again, we see excellent agreement between theory 
and simulation. However, for pb ^ 0.001 the theory underestimates E[k] and 
overestimates E[s], in such a way that the two effects cancel each other (data 
not shown). 

Finally, in Fig. IHl I show the change in log fitness as a function of the 
mean effect of new beneficial mutations Sb, while holding the mean effect 
of new deleterious mutations constant at Sd = 0.11 (sh = 0.05). As 
shown in Appendix 1, the theory predicts that both E[k] and E[s] should 
depend linearly on Sb for all parameter values. The change in log fitness 
should therefore depend quadratically on Sb for Sb ^ 1, which means that 
dlnw{t)/dt should appear approximately as a straight line with slope 2 in 
the double- logarithmic plot. We see that the simulation data agree very well 
with the theory as long as Sb < s^, but start to diverge slowly as Sb grows 
larger than Sd- 

DISCUSSION 

Clonal interference is often said to impose a speed limit on adaptation. 
Here, I have shown that the speed of adaptation, measured as the change in 
log fitness over time, does not reach a hard limit, but continues to grow even 
for very large N. This growth is fueled by the discovery of mutations with 
ever larger beneficial effect in large populations, rather than by an increase 
in the rate of substitutions. 

My results hinge on the assumption that new beneficial mutations are 
exponentially distributed. If beneficial mutations are distributed such that 
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large effects are absent, then the rate of adaptation will most likely reach 
an upper limit for large N. If on the other hand beneficial effects follow a 
distribution with long tail (such as a power-law or Cauchy distribution), then 
the speed of adaptation may grow even faster than predicted by Eq. (|TO|l for 
large A^. To date, we do not have a good understanding of the true distri- 
bution of beneficial effects in experimental systems. However, an exponen- 
tial distribution has good theoretical support ( Gillespie 1983 Gillespie 1991} 
lOrr 2nn3;i . has led to good agreement between theory and experiment in 
E. coli (jR.ozen et al. 2nn2| . and overall seems to be a reasonable choice. 

Arguments for an exponential distribution of new deleterious mutations 
are not as strong. At the same time, the theory is much less dependent 
on the particulars of the distribution of deleterious mutations. As long as 
we have an accurate expression for Pq, and beneficial mutations are unlikely 
to compensate deleterious mutations, the theory should work. In practice, 
this means that the theory should work with any distribution that does 
not produce an excessive amount of slightly deleterious mutations. (Neutral 
mutations could be dealt with by considering them as a reduction in the 
overall mutation rate U.) 

De Visser et al. (1999) measured the speed of adaptation in E. coli, vary- 
ing both the population size and the mutation rate each over approximately 
two orders of magnitude. They found that the speed of adaptation did not 
grow in proportion to increases in population size or mutation rate. In fact, 
apart from the experiments carried out at the lowest mutation rate, the speed 
of adaptation changed only very little with population size or mutation rate. 
These results indicate that the populations with the larger size and higher 
mutation rates could not benefit from the additional beneficial mutations 
that must have appeared. The results of de Visser et al. (1999)| thus provide 
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good support for the clonal interference model on a qualitative level. Quanti- 
tatively, however, their data seem to disagree with the model analyzed here: 
Fig. 2A of de Visser et al. (1999)| suggests that the speed of adaptation runs 
quickly into a hard limit, whereas the model predicts that the speed should 
continue to grow logarithmically, at least with respect to population size. 

There are two reasons that may have caused this discrepancy. First, 
de Visser et al. (1999)| plotted the speed of adaptation versus the relative 



mutation-supply rate (which is the product of population size and relative 
mutation rate). Such a plot is problematic, because the mutation-supply rate 
does not uniquely specify the speed of adaptation in the clonal interference 
model. [In order for the mutation-supply rate to uniquely specify the speed 
of adaptation, population size and mutation rate would have to enter the 
equations always as a product, which is not the case. In Eq. (jlOj) . for example, 
the term Pq depends on U but not on N, while the term InA^ does not 
depend on U.] The model predicts that the speed of adaptation should 
increase with increasing A^, whereas it should reach a maximum and then 
decrease with increasing U. Thus, a plot of the speed of adaptation versus 
population size (at fixed U) is inherently more informative than a plot of 
the speed of adaptation versus mutation rate (at fixed A^). In the latter 
case, a decline in the increase of the speed of adaptation may also indicate 
that the mutation rate approaches the optimal mutation rate U = s^. Since 
de Visser et al. (1999)| studied only two different population sizes (and three 



mutation rates), it is not possible to replot a subset of their data versus 
population size at fixed U and obtain a quantitative comparison to the model. 

Second, the speed of adaptation at a high mutation-supply rate may have 
been reduced (thus giving the impression of a hard speed limit) in part be- 
cause the populations began to run out of beneficial mutations. De Visser 
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et al. propagated the populations for 1000 generations, and determined the 
speed of adaptation from the total fitness increase over these 1000 gener- 
ations. In particular for the large population size, it seems that adapta- 
tion slowed down considerably after 500 generations (jde Visser et al. 1999| 
Fig. IB). [Note, however, that this argument does not invalidate the overall 
conclusion of |de Visser et al. (1999)] The fitness increase after 200 genera- 
tions in their Fig. 1 shows strong dependence on the mutation rate for the 
small population size, and weak dependence on the mutation rate for the large 
population size, which agrees very well with the predictions of the clonal in- 
terference model.] A related explanation for the apparent slowdown in the 
speed of adaptation at large population size is that the large populations may 
have found mutations of large beneficial effect earlier in the experiments than 
the small populations, as predicted by the clonal interference model. 

The clonal interference model assumes an infinite supply of beneficial 
mutations, and this assumption is of course unrealistic. Nevertheless, we can 
expect good agreement between model and experiment if the experiment is 
restricted to a relatively short number of generations, or if only the effect of 
the first fixed mutation is measured. An experiment of the latter kind was 
carried out by Rozen et al. (2002)] who found that the measured distribution 
of beneficial effects in E. coli was in good agreement with the distribution as 
predicted by the clonal interference model. 

The data of Rozen et al. (2002)| also allows us to estimate the onset of 
clonal interference in E. coli. By fitting the theoretical prediction for the dis- 
tribution of beneficial effects to their data, Rozen et al. (2002)] derived esti- 
mates for the mean beneficial effect of new mutations Sb and for the beneficial 
mutation rate Up\^. They found Sb = 0.024 and f/pb = 5.9 x 10^®. Having 
an estimate for the beneficial mutation rate, we can use Eq. (jH)) to estimate 
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the population size in these E. coli populations at which clonal interference 
becomes important. Since we do not have a good estimate for the mutational 
load in these populations, we set Pq = 1, which means that we neglect the 
effect of deleterious mutations. [See |Orr (2000)| for a discussion of this prob- 
lem and its implications for the estimates of Sb and f/pb-] As a consequence, 
we most likely underestimate the population size at which clonal interference 
becomes important. Further, instead of the factor 2 in front of Up\y, we use 
0.6. This factor takes into account that the E. coli populations fluctuate in 
size under standard laboratory conditions, see [Rozen et al. (2002)[ p. 1044. 
Thus, we use for our estimate A^lnA^ > 1/(0.6 x 5.9 x 10^^). This condition 
simplifies to > 2 x 10^. [Using a less accurate method based on only 
the expected substitution rate and mean beneficial effect of fixed mutations, 
Gerrish and Lenski (1998)| had earlier derived an estimate of Uph = 2 x 10~^, 



which leads to > 4.7 x 10^.] Rozen et al. (2002)| carried out their exper- 
iments at an effective population size of = 3.3 x 10^, which means that 
clonal interference probably had an effect on their results. This reasoning 
is consistent with the observation that the mean beneficial effect of fixed 
mutations is clearly larger than 2sb in their data ()Rozen et al. 20021 Fig. 3). 

Clonal interference has not only been studied in E. coli, but also in vesicu- 
lar stomatitis virus (VSV). Following |de Visser et al. (1999)l|Miralles et al. (1999)| 
fitted a linear and a hyperbolic model to the rate of fitness change in VSV as 
a function of population size, and found that the hyperbolic model provided 
the better fit. However, their data does not plateau at high A^, and visual in- 
spection of their Fig. 1 suggests that a logarithmic model might fit their data 
as well. On the other hand, the multiplicity of infection was changed along- 
side with the population size in these experiments, so that a slow-down in the 
speed of adaptation could also be due to increased virus-virus interactions 
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within cells, rather than clonal interference (jWilke and Novella 2003|) . 

My simulations have shown that the theory of clonal interference works 
well for small to moderate mutation rates, but fails at high mutation rates, 
when Muller's ratchet becomes important. Another effect at high muta- 
tion rates that is neglected in the theory (but was also absent from the 
simulations) is the evolution of mutational robustness: If the distribution 
of deleterious mutations itself can change, then at a high mutation rate 
there is a selective pressure to minimize the mutational load of the popula- 
tion dvan Nimwegen et al. 19991 IWiIke «1- 20011 IWUke and Adami 2nn8;i . 
This effect will increase the mean fitness of the population, and will also 
increase the potential for further adaptation by increasing Pq. In general, 
at a high mutation rate we have to consider that a mutation will be com- 
bined with additional mutations on the way to fixation. Therefore, we 
cannot simply assume that a mutation of beneficial effect s has probabil- 
ity of fixation 2s, but have to use fairly complicated mathematical tools 
(such as multi-type branching processes) to calculate the fixation probabil- 
ity (jHarton 1 9951 1.Tohnson and Barton 20021 IWiIke 20081 IT^a et al. 20041 
As a consequence, it is unlikely that we will ever have a simple closed-form 
expression for the speed of adaptation at a high mutation rate. 

A second regime in which the theory — not surprisingly — breaks down is 
when Sb exceeds s^- In this regime, we cannot simply neglect all beneficial 
mutations that do not arise on genetic backgrounds free of deleterious muta- 
Johnson and Barton (2002)] have recently studied this situation, but 



tions. 



not in the clonal interference regime. In the clonal interference regime, there 
are two opposing effects to be considered: On the one hand, the total number 
of competing mutations should increase, since now beneficial mutations on 
deleterious backgrounds compete for fixation as well. On the other hand, the 
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total fitness effect of many of the mutations that are competing for fixation 
is smaller than what we expect from the distribution of new beneficial muta- 
tions, because the beneficial effects are reduced by deleterious backgrounds. 
Without a detailed analysis, it is unclear which of the two effects is more im- 
portant. However, if the results from the present study are an indication, in 
the clonal interference regime the number of competing mutations will be less 
important than the distribution of their beneficial effects, which means that 
the present theory should overestimate the rate of adaptation for Sb > Sd- 
Indeed, I observed exactly this behavior in my simulations (Fig. IHl). 

Whether the assumption < is reasonable is not yet resolved. (Likely, 
the answer to this question will also depend on the particular species under 
study and on the concrete selection regime.) Several authors found that dele- 
terious mutations were frequent but had a very small effect (jMukai et ah 19721 
lUhnishi 1977| Kibota and Lynch 1996 IShabalina and Kondrashov "19971 [Elena and Moya 19990 , 



while others found that deleterious mutations were less frequent but of larger 
effect dKeightley 1996 Fernandez and Lopez- Fanjul 1996| IGarcia-Dorado 1997j 



Keightley and Caballero 1997 ). If the first set of results is representative, 
then beneficial mutations may indeed have on average a larger effect on fit- 
ness than deleterious mutations. While it is reasonably straightforward to 
study the distribution of deleterious mutations in mutation-accumulation ex- 
periments, it is much harder to measure the distribution of new beneficial 
mutations (as opposed to the distribution of fixed beneficial mutations, which 
is skewed towards mutations of large effect). However, evidence from experi- 
mental evolution with viruses shows that in some cases, beneficial mutations 
must have very large effects: Wichman et al. (1999)| found several-thousand- 



fold increase in population growth after ten days of selection in phage $X174, 
and Novella et al. (1995)| found fitness increases by a factor of 10 or more 
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within five to ten generations in vesicular stomatitis virus. In both cases, the 
observed fitness increase within a very short time frame can only be explained 
by a large supply of beneficial mutations of large effect. 

To summarize, in this contribution I have found the following novel con- 
clusions: 

1. The expected rate of adaptation approaches the mean beneficial effect 
of new mutations for large N. 

2. The mean beneficial effect of fixed mutations grows logarithmically in 
N for large N. 

3. Clonal interference effects become important if NlnN is larger than 
l/(2C/pbPo). 

4. The speed of adaptation grows logarithmically in N for moderately 
large N, and double-logarithmically for extremely large N. 

5. For large N, the speed of adaptation is limited by the distribution of 
beneficial effects of new mutations rather than by the supply rate of 
new mutations. 
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APPENDIX 1 



Expected substitution rate and mean beneficial effect: First, we 
notice that E[/c] depends linearly on the mean effect of beneficial mutations: 
Substituting x for s/sb in Eq. and writing A = 2UphPoN\nN, we find 

oo 

E[A;] = ^ J xexp[-A{l + l/x)e-^ - x]dx . (14) 



Therefore, the shape of E[/c] as a function of iV or [/ is independent of the 
value of Sb- 

We are interested in an asymptotic expansion of the integral in Eq. (fT^ 
for large A^. For the asymptotic expansion to work, needs to be so large 
that A is large. Clearly, for any given Up\yPo, we can always choose A^ 
sufficiently large such that A is large. Because of the exponential factor in 
the integrand, the main contribution to the integral comes from values of x 
for which A{l + l/x)e~^+x is small. Since the first term decays exponentially 
with X while the second term grows linearly, in general the main contribution 
to the integral will come from small x. However, for large A, the first term 
becomes small only when x is substantially larger than one. In this regime, 
we can neglect the term 1/x, and the integral in Eq. (fT^ is then identical to 
the integral Jn{A) defined in Appendix 2 with n = 1. Using the expression 
for Ji{A) given in Eq. (j23|l . we find 

m] - j^[ln(2[/pbPoiVln AT) + 7] . (15) 

In the limit of very large A^, we obtain the even simpler expression E[k] ^ Sh- 
Using a reasoning similar to that for the expected substitution rate, we 
find that the expected beneficial effect [as given in Eq. 0] also depends 
linearly on Sb, and simplifies for large A^ to E[s] = ShJ2{A) / Ji{A) (again 
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with A = 2UpiiPoN\nN). Using the expressions given in Eqs. and (|^. 
we find 

In the hmit of very large N, the second term disappears, and we end up with 



E[s] ^ Sk[\n{2UpbPoN\nN) + 7] . (17) 

Estimating the onset of clonal interference: We can derive an es- 
timate of the parameter region in which clonal interference becomes impor- 
tant by calculating the point at which the approximation for E[A;] for small 
[Eq. (^] comes the closest to the approximation for E[A;] for large A^, Eq. 0. 
Since the shape of E[k] is not infiuenced by Sb (see above), we can set Sb = 1 
for this calculation. Further, we write C = 2f/pb-Po- Now, we have to find 
the minimum of the function 

g{C, N) = CN - [\n{CN In A^) + 7]/ In . (18) 

We find dg{C, N)/dC = N- l/(Cln A^), which leads to the condition 

ArinA^>-^ (19) 
(_/ 

for the onset of clonal interference. (Differentiating with respect to A^ yields 
approximately the same condition). This condition cannot be solved for A^ 
in a closed-form expression, but is easy to evaluate numerically. 

APPENDIX 2 

Integrals: For the asymptotic expansion, we have to solve integrals of 
the form 

00 

J„(A) = y a;" exp(-v4e-^ - x)dx , (20) 
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in particular for the cases n — 1 and n — 2. After substituting z — Ae~^, we 
obtain 

A 

Jn{A) = jJ{laA- \nzfe-'dz 



-\j^{^^{^^AT-\-lf j{\nzfe--^dz. (21) 



fe=o 



The main contribution to the remaining integral comes from small z, while A 
is large in the cases considered here. Therefore, we can replace the upper limit 

of integration with oo. For the three relevant cases /c = 0, A; = 1, and k = 2, 
the integrals are e^^dz = 1, \n z e^^dz = —7, (In z)^e~^dz = 7^ + 
7r^/6, where 7 ~ 0.5772 is the Euler constant. Thus, we find approximately 

Ji(A) = (lnA + 7)M, (22) 
MA) = [(In A + ^f + nyeyA . (23) 
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population size N 



Figure 1: Expected substitution rate ^[k] versus population size N {U = 
0.04, pb = 0.0001, Sb = 0.01, sh = 0.05). The thick sohd hue stems from 
exact numerical evaluation of Eq. and the thick dashed line corresponds 
to approximation Eq. Q. The thin solid lines correspond to the approxima- 
tions for small and large N, Eqs. (0) and The dash-dotted line indicates 
the onset of clonal interference according to Eq. (jZj), and the dotted line 
indicates the same according to Eq. (jH)). 
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population size N 



Figure 2: Mean beneficial effect of fixed mutations E[s] versus population 
size N {U = 0.04, pb = 0.0001, Sb = 0.01, sh = 0.05). The solid line stems 
from exact numerical evaluation of Eq. (jSj), and the dashed line corresponds 
to approximation Eq. (}6|) . 
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mutation rate U 



Figure 3: Expected substitution rate E[A;] versus mutation rate U (pb = 
0.0001, Sb = 0.01, sh = 0.05). Population sizes are (from bottom to top) 
= 10^, = 10^, = 10^. Solid lines indicate the theoretical prediction 
Eq. (j2]), and points are simulation results. 
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mutation rate U 



Figure 4: Change in log fitness dlnw{t)/dt versus mutation rate U {ph = 
0.0001, Sb = 0.01, sh = 0.05). Population sizes are (from bottom to top) 
= 10*^, N — 10^, N — 10^. Solid lines indicate the theoretical prediction 
E[/c] ln(l + E[s]), and points are simulation results. For U — 0.4, MuUer's 
ratchet led to a negative d\nw{t) / dt in the populations of size N — 10^ and 
N — 10^. The corresponding two data points are therefore missing from this 
figure. 
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fraction of beneficial mutations p 



Figure 5: Change in log fitness dlnw (t) / dt versus fraction of beneficial muta- 
tions ph {U = 0.02, Sb = 0.01, sh = 0.05). Population sizes are (from bottom 
to top) N = W, N = 10^ N = 10^. Solid lines indicate the theoretical 
prediction E[A;] ln(l + E[s]), and points are simulation results. 
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mean beneficial effect s, 



Figure 6: Change in log fitness d\n.w{t)/dt versus mean effect of new ben- 
eficial mutations Sb (Pb — 0.0001, U — 0.02, sh = 0.05). Population sizes 
are (from bottom to top) = 10^, = 10^ = 10^. Solid lines indicate 

the theoretical prediction E[A;] ln(l + E[s]), and points are simulation results. 
In the shaded region, the mean effect of new beneficial mutations Sb exceeds 
the mean effect of new deleterious mutations s^. 
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